Face Detection and Lip Localization

نویسندگان

  • Benafsh Nadir Husain
  • Benafsh Husain
چکیده

Integration of audio and video signals for automatic speech recognition has become an important field of study. The AudioVisual Speech Recognition (AVSR) system is known to have accuracy higher than audio-only or visual-only system. The research focused on the visual front end and has been centered around lip segmentation. Experiments performed for lip feature extraction were mainly done in constrained environment with controlled background noise. In this thesis we focus our attention to a database collected in the environment of a moving car which hampered the quality of the imagery. We first introduce the concept of illumination compensation, where we try to reduce the dependency of light from over-or underexposed images. As a precursor to lip segmentation, we focus on a robust face detection technique which reaches an accuracy of 95%. We have detailed and compared three different face detection techniques and found a successful way of concatenating them in order to increase the overall accuracy. One of the detection techniques used was the object detection algorithm proposed by Viola-Jones. We have experimented with different color spaces using the Viola-Jones algorithm and have reached interesting conclusions. Following face detection we implement a lip localization algorithm based on the vertical gradients of hybrid equations of color. Despite the challenging background and image quality, success rate of 88% was achieved for lip segmentation. v ACKNOWLEDGMENTS The end of this thesis marks the culmination of my academic career at Cal Poly. My experiences at Cal Poly have inspired and shaped me to be thoroughly involved in my subject and in love with what I do. I dedicate this thesis to my family, my mom Tehmina and my brother Talib. Both of you have supported me in every way possible. I also want to thank Chiweng Kam, without whose guidance, motivation and positive attitude it would have been impossible to have seen this thesis through. I must also express my gratitude for Dr. Xiaozheng (Jane) Zhang for her guidance throughout this endeavor, her review of my work, and for the freedom in direction she allowed me to take on this project. I must also thank Dr John Saghri and Dr Fred DePiero for serving on my thesis committee and for fostering my interest in image processing and pattern recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lip Detection Based-on Normalized Rgb Chromaticity Diagram

This paper presents a new lip detection method based-on normalized RGB chromaticity diagram. The method consists of three stages: face detection, lip region localization and lip detection. The popular Viola-Jones face detection technique is employed in the face detection stage. In the lip detection stage, lip color is extracted using our novel color segmentation method that exploits the distrib...

متن کامل

The Combinational Use Of Knowledge-Based Methods and Morphological Image Processing in Color Image Face Detection

The human facial recognition is the base for all facial processing systems. In this work a basicmethod is presented for the reduction of detection time in fixed image with different color levels.The proposed method is the simplest approach in face spatial localization, since it doesn’trequire the dynamics of images and information of the color of skin in image background. Inaddition, to do face...

متن کامل

Mouth Localization for Appearance-based Lip Motion Analysis

Analysis of lip motions can be deployed in a variety of applications, e. g. visual speech reading or liveness verification as part of a person authentication system. When utilizing appearance-based features to describe lip shapes (visemes), robustly detecting the position of the mouth center is an inevitable part of this task. In this paper we present an algorithm for mouth localization as part...

متن کامل

Vowel Recognition by Using the Combination of Haar Wavelet and Neural Network

The lips movements are important in speech recognition and the Lip image segmentation has a significant role in image analysis. In this paper we present a novel technique to recognize Persian Vowels. The method is based on face detection and pupil location. First we perform the lip localization, then the color space CIE L*U*V* and CIE L*a*b* is used in order to improve the contrast between the ...

متن کامل

Robust Lip Localization on Multi-view Faces in Video

In this paper, a fast and robust multi-view lip localization algorithm in video is proposed. We consider lip localization as a binary classification problem, where a classifier is learned to distinguish between the lip and the region surrounding it. The classifier we use here is a histogrambased one which exploits the anthropometrical properties of the human face with the help of face scale nor...

متن کامل

An Efficient Algorithm for Lip Segmentation in Color Face Images Based on Local Information

Lip detection is used in many applications such as face detection and lips reading. In previous works, researchers have considered whole of face image for lip detection. In this paper we propose a new algorithm. In our algorithm for reducing required calculation and increase accuracy of correct detection, we do not consider whole of the face image. We first remove the upper half part of the fac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011